Dataset statistics
| Number of variables | 13 |
|---|---|
| Number of observations | 4765 |
| Missing cells | 0 |
| Missing cells (%) | 0.0% |
| Duplicate rows | 0 |
| Duplicate rows (%) | 0.0% |
| Total size in memory | 502.6 KiB |
| Average record size in memory | 108.0 B |
Variable types
| DateTime | 1 |
|---|---|
| Categorical | 3 |
| Numeric | 8 |
| Unsupported | 1 |
location has a high cardinality: 3930 distinct values | High cardinality |
operator has a high cardinality: 2201 distinct values | High cardinality |
ac_type has a high cardinality: 2370 distinct values | High cardinality |
passenger_aboard is highly overall correlated with passenger_fatalities | High correlation |
crew_aboard is highly overall correlated with crew_fatalities | High correlation |
passenger_fatalities is highly overall correlated with passenger_aboard | High correlation |
crew_fatalities is highly overall correlated with crew_aboard | High correlation |
decade is highly overall correlated with year | High correlation |
year is highly overall correlated with decade | High correlation |
location is uniformly distributed | Uniform |
ground is an unsupported type, check if it needs cleaning or further analysis | Unsupported |
passenger_aboard has 869 (18.2%) zeros | Zeros |
all_fatalities has 74 (1.6%) zeros | Zeros |
passenger_fatalities has 1039 (21.8%) zeros | Zeros |
crew_fatalities has 398 (8.4%) zeros | Zeros |
Reproduction
| Analysis started | 2023-05-24 08:12:23.706989 |
|---|---|
| Analysis finished | 2023-05-24 08:13:19.625437 |
| Duration | 55.92 seconds |
| Software version | ydata-profiling vv4.1.2 |
| Download configuration | config.json |
datetime
Date
| Distinct | 4366 |
|---|---|
| Distinct (%) | 91.6% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 74.5 KiB |
| Minimum | 1908-09-17 00:00:00 |
|---|---|
| Maximum | 2021-07-06 00:00:00 |
location
Categorical
HIGH CARDINALITY  UNIFORM 
| Distinct | 3930 |
|---|---|
| Distinct (%) | 82.5% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 74.5 KiB |
| Manila, Philippines | 15 |
|---|---|
| Moscow, Russia | 15 |
| New York, New York | 14 |
| Rio de Janeiro, Brazil | 12 |
| Cairo, Egypt | 12 |
| Other values (3925) |
Length
| Max length | 72 |
|---|---|
| Median length | 51 |
| Mean length | 20.823924 |
| Min length | 1 |
Characters and Unicode
| Total characters | 99226 |
|---|---|
| Distinct characters | 90 |
| Distinct categories | 9 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 2 ? |
Unique
| Unique | 3514 ? |
|---|---|
| Unique (%) | 73.7% |
Sample
| 1st row | Fort Myer, Virginia |
|---|---|
| 2nd row | Juvisy-sur-Orge, France |
| 3rd row | Atlantic City, New Jersey |
| 4th row | Victoria, British Columbia, Canada |
| 5th row | Tienen, Belgium |
Common Values
| Value | Count | Frequency (%) |
| Manila, Philippines | 15 | 0.3% |
| Moscow, Russia | 15 | 0.3% |
| New York, New York | 14 | 0.3% |
| Rio de Janeiro, Brazil | 12 | 0.3% |
| Cairo, Egypt | 12 | 0.3% |
| Sao Paulo, Brazil | 12 | 0.3% |
| Bogota, Colombia | 12 | 0.3% |
| Chicago, Illinois | 10 | 0.2% |
| Near Moscow, Russia | 10 | 0.2% |
| Tehran, Iran | 9 | 0.2% |
| Other values (3920) | 4644 |
Length
| Value | Count | Frequency (%) |
| near | 1279 | 9.2% |
| off | 319 | 2.3% |
| russia | 250 | 1.8% |
| new | 224 | 1.6% |
| brazil | 169 | 1.2% |
| colombia | 153 | 1.1% |
| canada | 127 | 0.9% |
| france | 118 | 0.8% |
| california | 115 | 0.8% |
| mexico | 110 | 0.8% |
| Other values (3994) | 11097 |
Most occurring characters
| Value | Count | Frequency (%) |
| a | 12410 | 12.5% |
| 9244 | 9.3% | |
| e | 6733 | 6.8% |
| i | 6301 | 6.4% |
| n | 6205 | 6.3% |
| r | 5734 | 5.8% |
| o | 5150 | 5.2% |
| , | 4985 | 5.0% |
| l | 3826 | 3.9% |
| s | 3393 | 3.4% |
| Other values (80) | 35245 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 70598 | |
| Uppercase Letter | 14041 | 14.2% |
| Space Separator | 9245 | 9.3% |
| Other Punctuation | 5136 | 5.2% |
| Dash Punctuation | 98 | 0.1% |
| Decimal Number | 66 | 0.1% |
| Control | 20 | < 0.1% |
| Close Punctuation | 11 | < 0.1% |
| Open Punctuation | 11 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 12410 | |
| e | 6733 | |
| i | 6301 | |
| n | 6205 | |
| r | 5734 | 8.1% |
| o | 5150 | 7.3% |
| l | 3826 | 5.4% |
| s | 3393 | 4.8% |
| t | 2949 | 4.2% |
| u | 2629 | 3.7% |
| Other values (31) | 15268 |
Uppercase Letter
| Value | Count | Frequency (%) |
| N | 1932 | |
| C | 1405 | 10.0% |
| S | 1073 | 7.6% |
| M | 961 | 6.8% |
| B | 909 | 6.5% |
| A | 861 | 6.1% |
| P | 760 | 5.4% |
| I | 687 | 4.9% |
| R | 638 | 4.5% |
| O | 541 | 3.9% |
| Other values (17) | 4274 |
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 24 | |
| 1 | 15 | |
| 2 | 9 | 13.6% |
| 5 | 8 | 12.1% |
| 8 | 3 | 4.5% |
| 9 | 2 | 3.0% |
| 3 | 2 | 3.0% |
| 7 | 2 | 3.0% |
| 6 | 1 | 1.5% |
Other Punctuation
| Value | Count | Frequency (%) |
| , | 4985 | |
| . | 115 | 2.2% |
| ' | 24 | 0.5% |
| / | 6 | 0.1% |
| ? | 5 | 0.1% |
| : | 1 | < 0.1% |
Space Separator
| Value | Count | Frequency (%) |
| 9244 | ||
| Â | 1 | < 0.1% |
Control
| Value | Count | Frequency (%) |
| 15 | ||
| 5 | 25.0% |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 98 |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 11 |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 11 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 84639 | |
| Common | 14587 | 14.7% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| a | 12410 | |
| e | 6733 | 8.0% |
| i | 6301 | 7.4% |
| n | 6205 | 7.3% |
| r | 5734 | 6.8% |
| o | 5150 | 6.1% |
| l | 3826 | 4.5% |
| s | 3393 | 4.0% |
| t | 2949 | 3.5% |
| u | 2629 | 3.1% |
| Other values (58) | 29309 |
Common
| Value | Count | Frequency (%) |
| 9244 | ||
| , | 4985 | |
| . | 115 | 0.8% |
| - | 98 | 0.7% |
| 0 | 24 | 0.2% |
| ' | 24 | 0.2% |
| 15 | 0.1% | |
| 1 | 15 | 0.1% |
| ) | 11 | 0.1% |
| ( | 11 | 0.1% |
| Other values (12) | 45 | 0.3% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 99186 | |
| None | 40 | < 0.1% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| a | 12410 | 12.5% |
| 9244 | 9.3% | |
| e | 6733 | 6.8% |
| i | 6301 | 6.4% |
| n | 6205 | 6.3% |
| r | 5734 | 5.8% |
| o | 5150 | 5.2% |
| , | 4985 | 5.0% |
| l | 3826 | 3.9% |
| s | 3393 | 3.4% |
| Other values (63) | 35205 |
None
| Value | Count | Frequency (%) |
| é | 12 | |
| ö | 5 | |
| ó | 4 | 10.0% |
| Ã | 4 | 10.0% |
| ï | 2 | 5.0% |
| á | 2 | 5.0% |
| è | 1 | 2.5% |
| ô | 1 | 2.5% |
| Ã | 1 | 2.5% |
| ä | 1 | 2.5% |
| Other values (7) | 7 |
operator
Categorical
| Distinct | 2201 |
|---|---|
| Distinct (%) | 46.2% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 74.5 KiB |
| Aeroflot | 247 |
|---|---|
| Military - U.S. Air Force | 132 |
| Air France | 65 |
| Deutsche Lufthansa | 63 |
| United Air Lines | 44 |
| Other values (2196) |
Length
| Max length | 65 |
|---|---|
| Median length | 48 |
| Mean length | 18.701364 |
| Min length | 1 |
Characters and Unicode
| Total characters | 89112 |
|---|---|
| Distinct characters | 87 |
| Distinct categories | 9 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 2 ? |
Unique
| Unique | 1695 ? |
|---|---|
| Unique (%) | 35.6% |
Sample
| 1st row | Military - U.S. Army |
|---|---|
| 2nd row | ? |
| 3rd row | Military - U.S. Navy |
| 4th row | Private |
| 5th row | Military - German Navy |
Common Values
| Value | Count | Frequency (%) |
| Aeroflot | 247 | 5.2% |
| Military - U.S. Air Force | 132 | 2.8% |
| Air France | 65 | 1.4% |
| Deutsche Lufthansa | 63 | 1.3% |
| United Air Lines | 44 | 0.9% |
| Military - U.S. Army Air Forces | 43 | 0.9% |
| Pan American World Airways | 40 | 0.8% |
| China National Aviation Corporation | 37 | 0.8% |
| American Airlines | 36 | 0.8% |
| US Aerial Mail Service | 34 | 0.7% |
| Other values (2191) | 4024 |
Length
| Value | Count | Frequency (%) |
| air | 1371 | 10.2% |
| airlines | 831 | 6.2% |
| 823 | 6.1% | |
| military | 638 | 4.7% |
| force | 476 | 3.5% |
| airways | 437 | 3.2% |
| u.s | 263 | 2.0% |
| aeroflot | 259 | 1.9% |
| lines | 183 | 1.4% |
| aviation | 136 | 1.0% |
| Other values (2039) | 8037 |
Most occurring characters
| Value | Count | Frequency (%) |
| i | 9582 | 10.8% |
| 8709 | 9.8% | |
| r | 8276 | 9.3% |
| a | 7317 | 8.2% |
| e | 6494 | 7.3% |
| n | 5312 | 6.0% |
| A | 4842 | 5.4% |
| o | 4142 | 4.6% |
| s | 3878 | 4.4% |
| l | 3826 | 4.3% |
| Other values (77) | 26734 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 64380 | |
| Uppercase Letter | 14184 | 15.9% |
| Space Separator | 8710 | 9.8% |
| Dash Punctuation | 793 | 0.9% |
| Other Punctuation | 785 | 0.9% |
| Open Punctuation | 113 | 0.1% |
| Close Punctuation | 113 | 0.1% |
| Decimal Number | 26 | < 0.1% |
| Control | 8 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| i | 9582 | |
| r | 8276 | |
| a | 7317 | |
| e | 6494 | |
| n | 5312 | |
| o | 4142 | |
| s | 3878 | 6.0% |
| l | 3826 | 5.9% |
| t | 3688 | 5.7% |
| c | 1865 | 2.9% |
| Other values (28) | 10000 |
Uppercase Letter
| Value | Count | Frequency (%) |
| A | 4842 | |
| S | 1071 | 7.6% |
| M | 1057 | 7.5% |
| C | 870 | 6.1% |
| F | 805 | 5.7% |
| T | 662 | 4.7% |
| L | 652 | 4.6% |
| P | 494 | 3.5% |
| U | 485 | 3.4% |
| N | 456 | 3.2% |
| Other values (16) | 2790 |
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 5 | |
| 7 | 4 | |
| 4 | 4 | |
| 8 | 2 | 7.7% |
| 6 | 2 | 7.7% |
| 9 | 2 | 7.7% |
| 5 | 2 | 7.7% |
| 2 | 2 | 7.7% |
| 1 | 2 | 7.7% |
| 3 | 1 | 3.8% |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 639 | |
| / | 99 | 12.6% |
| ' | 23 | 2.9% |
| , | 10 | 1.3% |
| ? | 8 | 1.0% |
| & | 6 | 0.8% |
Space Separator
| Value | Count | Frequency (%) |
| 8709 | ||
| Â | 1 | < 0.1% |
Control
| Value | Count | Frequency (%) |
| 6 | ||
| 2 | 25.0% |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 793 |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 113 |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 113 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 78564 | |
| Common | 10548 | 11.8% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| i | 9582 | |
| r | 8276 | 10.5% |
| a | 7317 | 9.3% |
| e | 6494 | 8.3% |
| n | 5312 | 6.8% |
| A | 4842 | 6.2% |
| o | 4142 | 5.3% |
| s | 3878 | 4.9% |
| l | 3826 | 4.9% |
| t | 3688 | 4.7% |
| Other values (54) | 21207 |
Common
| Value | Count | Frequency (%) |
| 8709 | ||
| - | 793 | 7.5% |
| . | 639 | 6.1% |
| ( | 113 | 1.1% |
| ) | 113 | 1.1% |
| / | 99 | 0.9% |
| ' | 23 | 0.2% |
| , | 10 | 0.1% |
| ? | 8 | 0.1% |
| 6 | 0.1% | |
| Other values (13) | 35 | 0.3% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 88993 | |
| None | 119 | 0.1% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| i | 9582 | 10.8% |
| 8709 | 9.8% | |
| r | 8276 | 9.3% |
| a | 7317 | 8.2% |
| e | 6494 | 7.3% |
| n | 5312 | 6.0% |
| A | 4842 | 5.4% |
| o | 4142 | 4.7% |
| s | 3878 | 4.4% |
| l | 3826 | 4.3% |
| Other values (64) | 26615 |
None
| Value | Count | Frequency (%) |
| é | 99 | |
| á | 5 | 4.2% |
| Ã | 2 | 1.7% |
| Ã | 2 | 1.7% |
| ï | 2 | 1.7% |
| ó | 2 | 1.7% |
| ú | 1 | 0.8% |
| Â | 1 | 0.8% |
| ã | 1 | 0.8% |
| è | 1 | 0.8% |
| Other values (3) | 3 | 2.5% |
ac_type
Categorical
| Distinct | 2370 |
|---|---|
| Distinct (%) | 49.7% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 74.5 KiB |
| Douglas DC-3 | 314 |
|---|---|
| de Havilland Canada DHC-6 Twin Otter 300 | 81 |
| Douglas C-47A | 69 |
| Douglas C-47 | 57 |
| Douglas DC-4 | 40 |
| Other values (2365) |
Length
| Max length | 42 |
|---|---|
| Median length | 36 |
| Mean length | 18.528856 |
| Min length | 1 |
Characters and Unicode
| Total characters | 88290 |
|---|---|
| Distinct characters | 76 |
| Distinct categories | 10 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 2 ? |
Unique
| Unique | 1784 ? |
|---|---|
| Unique (%) | 37.4% |
Sample
| 1st row | Wright Flyer III |
|---|---|
| 2nd row | Wright Byplane |
| 3rd row | Dirigible |
| 4th row | Curtiss seaplane |
| 5th row | Zeppelin L-8 (airship) |
Common Values
| Value | Count | Frequency (%) |
| Douglas DC-3 | 314 | 6.6% |
| de Havilland Canada DHC-6 Twin Otter 300 | 81 | 1.7% |
| Douglas C-47A | 69 | 1.4% |
| Douglas C-47 | 57 | 1.2% |
| Douglas DC-4 | 40 | 0.8% |
| Yakovlev YAK-40 | 35 | 0.7% |
| Antonov AN-26 | 31 | 0.7% |
| Junkers JU-52/3m | 29 | 0.6% |
| De Havilland DH-4 | 27 | 0.6% |
| Douglas DC-6B | 27 | 0.6% |
| Other values (2360) | 4055 |
Length
| Value | Count | Frequency (%) |
| douglas | 1085 | 8.4% |
| boeing | 402 | 3.1% |
| dc-3 | 367 | 2.8% |
| lockheed | 315 | 2.4% |
| de | 294 | 2.3% |
| havilland | 292 | 2.3% |
| antonov | 273 | 2.1% |
| canada | 159 | 1.2% |
| otter | 146 | 1.1% |
| ilyushin | 140 | 1.1% |
| Other values (2442) | 9494 |
Most occurring characters
| Value | Count | Frequency (%) |
| 8231 | 9.3% | |
| - | 4960 | 5.6% |
| e | 4597 | 5.2% |
| a | 4474 | 5.1% |
| o | 4394 | 5.0% |
| n | 3716 | 4.2% |
| l | 3489 | 4.0% |
| i | 3223 | 3.7% |
| r | 3132 | 3.5% |
| C | 2927 | 3.3% |
| Other values (66) | 45147 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 44080 | |
| Uppercase Letter | 17142 | 19.4% |
| Decimal Number | 13340 | 15.1% |
| Space Separator | 8232 | 9.3% |
| Dash Punctuation | 4960 | 5.6% |
| Other Punctuation | 240 | 0.3% |
| Open Punctuation | 146 | 0.2% |
| Close Punctuation | 145 | 0.2% |
| Math Symbol | 3 | < 0.1% |
| Control | 2 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 4597 | |
| a | 4474 | |
| o | 4394 | |
| n | 3716 | 8.4% |
| l | 3489 | 7.9% |
| i | 3223 | 7.3% |
| r | 3132 | 7.1% |
| s | 2765 | 6.3% |
| t | 2241 | 5.1% |
| u | 2129 | 4.8% |
| Other values (18) | 9920 |
Uppercase Letter
| Value | Count | Frequency (%) |
| C | 2927 | |
| D | 2728 | |
| A | 1834 | |
| B | 1657 | |
| H | 956 | 5.6% |
| L | 823 | 4.8% |
| F | 778 | 4.5% |
| S | 728 | 4.2% |
| I | 628 | 3.7% |
| T | 606 | 3.5% |
| Other values (16) | 3477 |
Decimal Number
| Value | Count | Frequency (%) |
| 2 | 2092 | |
| 0 | 2054 | |
| 1 | 1945 | |
| 4 | 1646 | |
| 3 | 1625 | |
| 7 | 1449 | |
| 6 | 849 | |
| 5 | 688 | 5.2% |
| 8 | 635 | 4.8% |
| 9 | 357 | 2.7% |
Other Punctuation
| Value | Count | Frequency (%) |
| / | 163 | |
| . | 70 | |
| ? | 4 | 1.7% |
| , | 2 | 0.8% |
| & | 1 | 0.4% |
Space Separator
| Value | Count | Frequency (%) |
| 8231 | ||
| Â | 1 | < 0.1% |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 4960 |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 146 |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 145 |
Math Symbol
| Value | Count | Frequency (%) |
| + | 3 |
Control
| Value | Count | Frequency (%) |
| 2 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 61222 | |
| Common | 27068 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 4597 | 7.5% |
| a | 4474 | 7.3% |
| o | 4394 | 7.2% |
| n | 3716 | 6.1% |
| l | 3489 | 5.7% |
| i | 3223 | 5.3% |
| r | 3132 | 5.1% |
| C | 2927 | 4.8% |
| s | 2765 | 4.5% |
| D | 2728 | 4.5% |
| Other values (44) | 25777 |
Common
| Value | Count | Frequency (%) |
| 8231 | ||
| - | 4960 | |
| 2 | 2092 | 7.7% |
| 0 | 2054 | 7.6% |
| 1 | 1945 | 7.2% |
| 4 | 1646 | 6.1% |
| 3 | 1625 | 6.0% |
| 7 | 1449 | 5.4% |
| 6 | 849 | 3.1% |
| 5 | 688 | 2.5% |
| Other values (12) | 1529 | 5.6% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 88275 | |
| None | 15 | < 0.1% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 8231 | 9.3% | |
| - | 4960 | 5.6% |
| e | 4597 | 5.2% |
| a | 4474 | 5.1% |
| o | 4394 | 5.0% |
| n | 3716 | 4.2% |
| l | 3489 | 4.0% |
| i | 3223 | 3.7% |
| r | 3132 | 3.5% |
| C | 2927 | 3.3% |
| Other values (63) | 45132 |
None
| Value | Count | Frequency (%) |
| é | 11 | |
| è | 3 | 20.0% |
| Â | 1 | 6.7% |
all_aboard
Real number (ℝ)
| Distinct | 243 |
|---|---|
| Distinct (%) | 5.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 31.364953 |
| Minimum | 0 |
|---|---|
| Maximum | 644 |
| Zeros | 5 |
| Zeros (%) | 0.1% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 74.5 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 2 |
| Q1 | 6 |
| median | 16 |
| Q3 | 35 |
| 95-th percentile | 119 |
| Maximum | 644 |
| Range | 644 |
| Interquartile range (IQR) | 29 |
Descriptive statistics
| Standard deviation | 46.131898 |
|---|---|
| Coefficient of variation (CV) | 1.4708104 |
| Kurtosis | 23.407024 |
| Mean | 31.364953 |
| Median Absolute Deviation (MAD) | 11 |
| Skewness | 3.8770721 |
| Sum | 149454 |
| Variance | 2128.152 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 3 | 278 | 5.8% |
| 2 | 239 | 5.0% |
| 4 | 199 | 4.2% |
| 5 | 187 | 3.9% |
| 6 | 172 | 3.6% |
| 10 | 171 | 3.6% |
| 7 | 162 | 3.4% |
| 1 | 137 | 2.9% |
| 9 | 123 | 2.6% |
| 11 | 118 | 2.5% |
| Other values (233) | 2979 |
| Value | Count | Frequency (%) |
| 0 | 5 | 0.1% |
| 1 | 137 | |
| 2 | 239 | |
| 3 | 278 | |
| 4 | 199 | |
| 5 | 187 | |
| 6 | 172 | |
| 7 | 162 | |
| 8 | 117 | |
| 9 | 123 |
| Value | Count | Frequency (%) |
| 644 | 1 | |
| 524 | 1 | |
| 517 | 1 | |
| 394 | 1 | |
| 393 | 1 | |
| 384 | 1 | |
| 356 | 1 | |
| 349 | 1 | |
| 346 | 1 | |
| 340 | 1 |
passenger_aboard
Real number (ℝ)
HIGH CORRELATION  ZEROS 
| Distinct | 234 |
|---|---|
| Distinct (%) | 4.9% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 26.860021 |
| Minimum | 0 |
|---|---|
| Maximum | 614 |
| Zeros | 869 |
| Zeros (%) | 18.2% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 74.5 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 3 |
| median | 12 |
| Q3 | 30 |
| 95-th percentile | 112 |
| Maximum | 614 |
| Range | 614 |
| Interquartile range (IQR) | 27 |
Descriptive statistics
| Standard deviation | 44.099535 |
|---|---|
| Coefficient of variation (CV) | 1.641828 |
| Kurtosis | 24.155408 |
| Mean | 26.860021 |
| Median Absolute Deviation (MAD) | 11 |
| Skewness | 3.9368972 |
| Sum | 127988 |
| Variance | 1944.769 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 869 | 18.2% |
| 4 | 170 | 3.6% |
| 2 | 161 | 3.4% |
| 5 | 140 | 2.9% |
| 3 | 129 | 2.7% |
| 7 | 129 | 2.7% |
| 9 | 128 | 2.7% |
| 10 | 127 | 2.7% |
| 8 | 124 | 2.6% |
| 1 | 120 | 2.5% |
| Other values (224) | 2668 |
| Value | Count | Frequency (%) |
| 0 | 869 | |
| 1 | 120 | 2.5% |
| 2 | 161 | 3.4% |
| 3 | 129 | 2.7% |
| 4 | 170 | 3.6% |
| 5 | 140 | 2.9% |
| 6 | 109 | 2.3% |
| 7 | 129 | 2.7% |
| 8 | 124 | 2.6% |
| 9 | 128 | 2.7% |
| Value | Count | Frequency (%) |
| 614 | 1 | |
| 509 | 1 | |
| 503 | 1 | |
| 381 | 1 | |
| 374 | 1 | |
| 364 | 1 | |
| 338 | 1 | |
| 335 | 1 | |
| 327 | 1 | |
| 316 | 1 |
crew_aboard
Real number (ℝ)
| Distinct | 34 |
|---|---|
| Distinct (%) | 0.7% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 4.5150052 |
| Minimum | 0 |
|---|---|
| Maximum | 83 |
| Zeros | 7 |
| Zeros (%) | 0.1% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 74.5 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 1 |
| Q1 | 2 |
| median | 4 |
| Q3 | 6 |
| 95-th percentile | 11 |
| Maximum | 83 |
| Range | 83 |
| Interquartile range (IQR) | 4 |
Descriptive statistics
| Standard deviation | 3.7613019 |
|---|---|
| Coefficient of variation (CV) | 0.83306702 |
| Kurtosis | 62.989402 |
| Mean | 4.5150052 |
| Median Absolute Deviation (MAD) | 2 |
| Skewness | 4.9721286 |
| Sum | 21514 |
| Variance | 14.147392 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 3 | 951 | |
| 2 | 827 | |
| 4 | 685 | |
| 1 | 534 | |
| 5 | 513 | |
| 6 | 372 | 7.8% |
| 7 | 244 | 5.1% |
| 8 | 171 | 3.6% |
| 9 | 114 | 2.4% |
| 10 | 93 | 2.0% |
| Other values (24) | 261 | 5.5% |
| Value | Count | Frequency (%) |
| 0 | 7 | 0.1% |
| 1 | 534 | |
| 2 | 827 | |
| 3 | 951 | |
| 4 | 685 | |
| 5 | 513 | |
| 6 | 372 | 7.8% |
| 7 | 244 | 5.1% |
| 8 | 171 | 3.6% |
| 9 | 114 | 2.4% |
| Value | Count | Frequency (%) |
| 83 | 1 | < 0.1% |
| 61 | 1 | < 0.1% |
| 49 | 1 | < 0.1% |
| 43 | 1 | < 0.1% |
| 41 | 1 | < 0.1% |
| 33 | 1 | < 0.1% |
| 31 | 1 | < 0.1% |
| 30 | 1 | < 0.1% |
| 27 | 1 | < 0.1% |
| 25 | 4 |
all_fatalities
Real number (ℝ)
| Distinct | 199 |
|---|---|
| Distinct (%) | 4.2% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 22.481847 |
| Minimum | 0 |
|---|---|
| Maximum | 583 |
| Zeros | 74 |
| Zeros (%) | 1.6% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 74.5 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 1 |
| Q1 | 4 |
| median | 11 |
| Q3 | 25 |
| 95-th percentile | 87 |
| Maximum | 583 |
| Range | 583 |
| Interquartile range (IQR) | 21 |
Descriptive statistics
| Standard deviation | 35.676795 |
|---|---|
| Coefficient of variation (CV) | 1.5869157 |
| Kurtosis | 35.623846 |
| Mean | 22.481847 |
| Median Absolute Deviation (MAD) | 8 |
| Skewness | 4.5569371 |
| Sum | 107126 |
| Variance | 1272.8337 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 1 | 367 | 7.7% |
| 2 | 367 | 7.7% |
| 3 | 359 | 7.5% |
| 4 | 239 | 5.0% |
| 5 | 232 | 4.9% |
| 6 | 168 | 3.5% |
| 7 | 157 | 3.3% |
| 10 | 150 | 3.1% |
| 8 | 125 | 2.6% |
| 13 | 124 | 2.6% |
| Other values (189) | 2477 |
| Value | Count | Frequency (%) |
| 0 | 74 | 1.6% |
| 1 | 367 | |
| 2 | 367 | |
| 3 | 359 | |
| 4 | 239 | |
| 5 | 232 | |
| 6 | 168 | |
| 7 | 157 | |
| 8 | 125 | 2.6% |
| 9 | 120 | 2.5% |
| Value | Count | Frequency (%) |
| 583 | 1 | |
| 520 | 1 | |
| 349 | 1 | |
| 346 | 1 | |
| 329 | 1 | |
| 301 | 1 | |
| 298 | 1 | |
| 290 | 1 | |
| 275 | 1 | |
| 271 | 1 |
passenger_fatalities
Real number (ℝ)
HIGH CORRELATION  ZEROS 
| Distinct | 190 |
|---|---|
| Distinct (%) | 4.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 18.967051 |
| Minimum | 0 |
|---|---|
| Maximum | 560 |
| Zeros | 1039 |
| Zeros (%) | 21.8% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 74.5 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 1 |
| median | 8 |
| Q3 | 21 |
| 95-th percentile | 81 |
| Maximum | 560 |
| Range | 560 |
| Interquartile range (IQR) | 20 |
Descriptive statistics
| Standard deviation | 34.087024 |
|---|---|
| Coefficient of variation (CV) | 1.7971705 |
| Kurtosis | 36.901251 |
| Mean | 18.967051 |
| Median Absolute Deviation (MAD) | 8 |
| Skewness | 4.642977 |
| Sum | 90378 |
| Variance | 1161.9252 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 1039 | |
| 1 | 304 | 6.4% |
| 2 | 262 | 5.5% |
| 3 | 192 | 4.0% |
| 4 | 185 | 3.9% |
| 5 | 139 | 2.9% |
| 6 | 133 | 2.8% |
| 8 | 126 | 2.6% |
| 7 | 126 | 2.6% |
| 9 | 118 | 2.5% |
| Other values (180) | 2141 |
| Value | Count | Frequency (%) |
| 0 | 1039 | |
| 1 | 304 | 6.4% |
| 2 | 262 | 5.5% |
| 3 | 192 | 4.0% |
| 4 | 185 | 3.9% |
| 5 | 139 | 2.9% |
| 6 | 133 | 2.8% |
| 7 | 126 | 2.6% |
| 8 | 126 | 2.6% |
| 9 | 118 | 2.5% |
| Value | Count | Frequency (%) |
| 560 | 1 | |
| 505 | 1 | |
| 335 | 1 | |
| 316 | 1 | |
| 307 | 1 | |
| 287 | 1 | |
| 283 | 1 | |
| 278 | 1 | |
| 258 | 1 | |
| 257 | 1 |
crew_fatalities
Real number (ℝ)
HIGH CORRELATION  ZEROS 
| Distinct | 28 |
|---|---|
| Distinct (%) | 0.6% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 3.5867786 |
| Minimum | 0 |
|---|---|
| Maximum | 43 |
| Zeros | 398 |
| Zeros (%) | 8.4% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 74.5 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 2 |
| median | 3 |
| Q3 | 5 |
| 95-th percentile | 9 |
| Maximum | 43 |
| Range | 43 |
| Interquartile range (IQR) | 3 |
Descriptive statistics
| Standard deviation | 3.1747775 |
|---|---|
| Coefficient of variation (CV) | 0.88513339 |
| Kurtosis | 12.925451 |
| Mean | 3.5867786 |
| Median Absolute Deviation (MAD) | 2 |
| Skewness | 2.5035007 |
| Sum | 17091 |
| Variance | 10.079212 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 2 | 892 | |
| 3 | 823 | |
| 1 | 769 | |
| 4 | 590 | |
| 5 | 402 | |
| 0 | 398 | |
| 6 | 273 | 5.7% |
| 7 | 171 | 3.6% |
| 8 | 130 | 2.7% |
| 9 | 86 | 1.8% |
| Other values (18) | 231 | 4.8% |
| Value | Count | Frequency (%) |
| 0 | 398 | |
| 1 | 769 | |
| 2 | 892 | |
| 3 | 823 | |
| 4 | 590 | |
| 5 | 402 | |
| 6 | 273 | 5.7% |
| 7 | 171 | 3.6% |
| 8 | 130 | 2.7% |
| 9 | 86 | 1.8% |
| Value | Count | Frequency (%) |
| 43 | 1 | < 0.1% |
| 33 | 1 | < 0.1% |
| 27 | 1 | < 0.1% |
| 25 | 2 | < 0.1% |
| 23 | 6 | |
| 22 | 5 | |
| 21 | 2 | < 0.1% |
| 20 | 3 | |
| 19 | 5 | |
| 18 | 3 |
ground
Unsupported
REJECTED  UNSUPPORTED 
| Missing | 0 |
|---|---|
| Missing (%) | 0.0% |
| Memory size | 74.5 KiB |
decade
Real number (ℝ)
| Distinct | 13 |
|---|---|
| Distinct (%) | 0.3% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 1966.6065 |
| Minimum | 1900 |
|---|---|
| Maximum | 2020 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 55.8 KiB |
Quantile statistics
| Minimum | 1900 |
|---|---|
| 5-th percentile | 1930 |
| Q1 | 1950 |
| median | 1970 |
| Q3 | 1990 |
| 95-th percentile | 2010 |
| Maximum | 2020 |
| Range | 120 |
| Interquartile range (IQR) | 40 |
Descriptive statistics
| Standard deviation | 24.749065 |
|---|---|
| Coefficient of variation (CV) | 0.012584655 |
| Kurtosis | -0.94767328 |
| Mean | 1966.6065 |
| Median Absolute Deviation (MAD) | 20 |
| Skewness | -0.03293982 |
| Sum | 9370880 |
| Variance | 612.5162 |
| Monotonicity | Increasing |
| Value | Count | Frequency (%) |
| 1960 | 629 | |
| 1950 | 622 | |
| 1990 | 604 | |
| 1970 | 597 | |
| 1940 | 529 | |
| 1980 | 510 | |
| 2000 | 498 | |
| 1930 | 345 | |
| 2010 | 227 | 4.8% |
| 1920 | 175 | 3.7% |
| Other values (3) | 29 | 0.6% |
| Value | Count | Frequency (%) |
| 1900 | 2 | < 0.1% |
| 1910 | 13 | 0.3% |
| 1920 | 175 | 3.7% |
| 1930 | 345 | |
| 1940 | 529 | |
| 1950 | 622 | |
| 1960 | 629 | |
| 1970 | 597 | |
| 1980 | 510 | |
| 1990 | 604 |
| Value | Count | Frequency (%) |
| 2020 | 14 | 0.3% |
| 2010 | 227 | 4.8% |
| 2000 | 498 | |
| 1990 | 604 | |
| 1980 | 510 | |
| 1970 | 597 | |
| 1960 | 629 | |
| 1950 | 622 | |
| 1940 | 529 | |
| 1930 | 345 |
year
Real number (ℝ)
| Distinct | 110 |
|---|---|
| Distinct (%) | 2.3% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 1971.2109 |
| Minimum | 1908 |
|---|---|
| Maximum | 2021 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 74.5 KiB |
Quantile statistics
| Minimum | 1908 |
|---|---|
| 5-th percentile | 1931 |
| Q1 | 1951 |
| median | 1970 |
| Q3 | 1992 |
| 95-th percentile | 2010 |
| Maximum | 2021 |
| Range | 113 |
| Interquartile range (IQR) | 41 |
Descriptive statistics
| Standard deviation | 24.533143 |
|---|---|
| Coefficient of variation (CV) | 0.012445722 |
| Kurtosis | -0.96211932 |
| Mean | 1971.2109 |
| Median Absolute Deviation (MAD) | 20 |
| Skewness | -0.025550453 |
| Sum | 9392820 |
| Variance | 601.87511 |
| Monotonicity | Increasing |
| Value | Count | Frequency (%) |
| 1947 | 78 | 1.6% |
| 1948 | 78 | 1.6% |
| 1962 | 76 | 1.6% |
| 1972 | 74 | 1.6% |
| 1989 | 73 | 1.5% |
| 1946 | 73 | 1.5% |
| 1951 | 72 | 1.5% |
| 1970 | 71 | 1.5% |
| 1950 | 70 | 1.5% |
| 1994 | 70 | 1.5% |
| Other values (100) | 4030 |
| Value | Count | Frequency (%) |
| 1908 | 1 | < 0.1% |
| 1909 | 1 | < 0.1% |
| 1912 | 1 | < 0.1% |
| 1913 | 1 | < 0.1% |
| 1915 | 1 | < 0.1% |
| 1916 | 1 | < 0.1% |
| 1918 | 1 | < 0.1% |
| 1919 | 8 | |
| 1920 | 17 | |
| 1921 | 11 |
| Value | Count | Frequency (%) |
| 2021 | 6 | 0.1% |
| 2020 | 8 | 0.2% |
| 2019 | 13 | |
| 2018 | 19 | |
| 2017 | 15 | |
| 2016 | 22 | |
| 2015 | 18 | |
| 2014 | 22 | |
| 2013 | 25 | |
| 2012 | 24 |
| all_aboard | passenger_aboard | crew_aboard | all_fatalities | passenger_fatalities | crew_fatalities | decade | year | |
|---|---|---|---|---|---|---|---|---|
| all_aboard | 1.000 | 0.441 | 0.483 | 0.083 | 0.416 | 0.283 | 0.172 | 0.171 |
| passenger_aboard | 0.441 | 1.000 | 0.123 | 0.299 | 0.784 | 0.038 | 0.075 | 0.073 |
| crew_aboard | 0.483 | 0.123 | 1.000 | 0.166 | 0.111 | 0.679 | 0.062 | 0.061 |
| all_fatalities | 0.083 | 0.299 | 0.166 | 1.000 | 0.448 | 0.279 | 0.068 | 0.070 |
| passenger_fatalities | 0.416 | 0.784 | 0.111 | 0.448 | 1.000 | 0.169 | 0.072 | 0.068 |
| crew_fatalities | 0.283 | 0.038 | 0.679 | 0.279 | 0.169 | 1.000 | 0.023 | 0.021 |
| decade | 0.172 | 0.075 | 0.062 | 0.068 | 0.072 | 0.023 | 1.000 | 0.994 |
| year | 0.171 | 0.073 | 0.061 | 0.070 | 0.068 | 0.021 | 0.994 | 1.000 |
| datetime | location | operator | ac_type | all_aboard | passenger_aboard | crew_aboard | all_fatalities | passenger_fatalities | crew_fatalities | ground | decade | year | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 0 | 1908-09-17 | Fort Myer, Virginia | Military - U.S. Army | Wright Flyer III | 2 | 1 | 1 | 1 | 1 | 0 | 0 | 1900 | 1908 |
| 1 | 1909-09-07 | Juvisy-sur-Orge, France | ? | Wright Byplane | 1 | 0 | 1 | 1 | 0 | 0 | 0 | 1900 | 1909 |
| 2 | 1912-07-12 | Atlantic City, New Jersey | Military - U.S. Navy | Dirigible | 5 | 0 | 5 | 5 | 0 | 5 | 0 | 1910 | 1912 |
| 3 | 1913-08-06 | Victoria, British Columbia, Canada | Private | Curtiss seaplane | 1 | 0 | 1 | 1 | 0 | 1 | 0 | 1910 | 1913 |
| 6 | 1915-03-05 | Tienen, Belgium | Military - German Navy | Zeppelin L-8 (airship) | 41 | 0 | 41 | 17 | 0 | 17 | 0 | 1910 | 1915 |
| 10 | 1916-10-01 | Potters Bar, England | Military - German Navy | Zeppelin L-31 (airship) | 19 | 0 | 19 | 19 | 0 | 19 | 0 | 1910 | 1916 |
| 23 | 1918-12-16 | Elizabeth, New Jersey | US Aerial Mail Service | De Havilland DH-4 | 1 | 0 | 1 | 1 | 0 | 1 | 0 | 1910 | 1918 |
| 24 | 1919-05-25 | Cleveland, Ohio | US Aerial Mail Service | De Havilland DH-4 | 1 | 0 | 1 | 1 | 0 | 1 | 0 | 1910 | 1919 |
| 25 | 1919-07-19 | Dix Run, Pennsylvania | US Aerial Mail Service | De Havilland DH-4 | 1 | 0 | 1 | 1 | 0 | 1 | 0 | 1910 | 1919 |
| 27 | 1919-08-02 | Verona, Italy | Caproni Company | Caproni Ca.48 | 14 | 12 | 2 | 14 | 12 | 2 | 0 | 1910 | 1919 |
| datetime | location | operator | ac_type | all_aboard | passenger_aboard | crew_aboard | all_fatalities | passenger_fatalities | crew_fatalities | ground | decade | year | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 4997 | 2020-05-22 | Karachi, Pakistan | Pakistan International Airline | Airbus A320-214 | 99 | 91 | 8 | 97 | 89 | 8 | 1 | 2020 | 2020 |
| 4998 | 2020-08-07 | Calicut, India | Air India Exppress | Boeing 737-8HG | 190 | 184 | 6 | 20 | 18 | 2 | 0 | 2020 | 2020 |
| 4999 | 2020-08-22 | Juba, South Sudan | South West Aviaiton | Antonov 26B | 8 | 5 | 3 | 7 | 4 | 3 | 0 | 2020 | 2020 |
| 5000 | 2020-09-25 | Near Chuguev, Ukraine | Military - Ukraine Air Force | Antonov An26SH | 27 | 20 | 7 | 26 | 19 | 7 | 0 | 2020 | 2020 |
| 5001 | 2021-01-09 | Near Jakarta, Indonesia | Sriwijaya Air | Boeing 737-524 | 62 | 56 | 6 | 62 | 56 | 6 | 0 | 2020 | 2021 |
| 5002 | 2021-03-02 | Pieri, Sudan | South Sudan Supreme Airlines | Let L-410UVP-E | 10 | 8 | 2 | 10 | 8 | 2 | 0 | 2020 | 2021 |
| 5003 | 2021-03-28 | Near Butte, Alaska | Soloy Helicopters | Eurocopter AS350B3Â Ecureuil | 6 | 5 | 1 | 5 | 4 | 1 | 0 | 2020 | 2021 |
| 5004 | 2021-05-21 | Near Kaduna, Nigeria | Military - Nigerian Air Force | Beechcraft B300 King Air 350i | 11 | 7 | 4 | 11 | 7 | 4 | 0 | 2020 | 2021 |
| 5005 | 2021-06-10 | Near Pyin Oo Lwin, Myanmar | Military - Myanmar Air Force | Beechcraft 1900D | 14 | 12 | 2 | 12 | 11 | 1 | 0 | 2020 | 2021 |
| 5007 | 2021-07-06 | Palana, Russia | Kamchatka Aviation Enterprise | Antonov An 26B-100 | 28 | 22 | 6 | 28 | 22 | 6 | 0 | 2020 | 2021 |